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PATENT APPLICATION 
Transformation System Based on the Integrase Gene and Attachment Site 
for Myxococcus xanthus Bacteriophage Mx9 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims benefit of U.S. provisional patent application no. 

60/405,196, filed August 21, 2002, the entire contents of which are incorporated by 
reference. 

FIELD OF THE INVENTION 
[0002] The invention relates to methods and materials for transforming host 

bacteria] cells using a bacteriophage Mx9 system. The invention finds application in the 
fields of molecular biology and dmg development. 

BACKGROUND OF THE INVENTION 
[0003] Mx9 is a general transducing phage that infects the Gram-negative 

bacterium Myxococcus xanthus (9). The phage particle has a polyhedral head vidth a very 
short tail. Stmcturally it resembles Mx8, which also infects M. xanthus. 
[0004] The integrase gene and attachment site for Mx8 have been characterized 

(7, 8, 1 1). Integration of Mx8 by site-specific recombination requires a single phage 
protein, Int, and the phage attachment site, a^/P. Unlike most temperate bacteriophage, 
the Mx8 attP site is contained within the int gene and upon insertion into the M xanthtds 
chromosome, the 3' end of the int gene is altered. This modified int gene produces a 
protein, IntX, with lower specific integrase activity (8). 

[0005] Because no natural replicating plasmids have been identified for M 

xanthus, or for any other myxobacteria, phage attachment sites provide an efficient and 
stable alternative for introducing new genes or adding additional copies of existing ones 
into the cell. The Mx8 int and attachment site can be used to integrate DNA into the 
chromosome, but expression of many genes is affected by insertion into the Mx8 attB 
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sites; many developmental as well as two constitutive promoters, mgl ondpilA, have 
reduced activity at this site (2, 6). There remains a need for more effective and reliable 
transformation systems that will enable insertion of DNA into the chromosome of M 
xanthus and other bacteria. The present invention meets these and other needs. 

SUMMARY OF THE INVENTION 
[0006] The present invention provides methods and materials for transfonning 

host cells using a bacteriophage Mx9 transformation system. In another aspect, the 
present methods, materials, host cells and vectors are directed to enhancing the 
production of a useful compound, including but not limited to a polyketide, through the 
introduction of one or more genes into the DNA of a variety of bacterial host cells. 
[0007] In one aspect, the invention provides a method for modification of a DNA 

of a bacterial cell comprising in its genome a first attachment site recognized by a protein 
with Mx9 integrase activity, comprising introducing a Mx9 transformation system into 
tlie cell, said system comprising (a) a gene encoding a protein with Mx9 integrase activity 
protein operably linked to a promoter active in the host cell, and (b) a DNA vector 
comprising a second attachment site recognized by the integrase protein, which may be 
the same as the first attachment site. 

[0008] These and other embodiments of the invention are described m more detail 

in the following description, examples, and claims set forth below. 



BRIEF DESCRIPTION OF THE FIGURES 
[0009] Figure 1 presents a physical map of the int region firom Mx9. Boxes 

represent putative open reading frames. The hatched box in int designates the position of 
attP. 

[0010] Figure 2 presents the nucleotide sequence of the Mx9 int gene [SEQ ID 

N0:1] and the deduced amino acid sequence [SEQ ID N0:2]. Amino acids are in one- 
letter code underneath the DNA sequence. The sequence in bold [SEQ ID NO:5] is the 
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Mx9 attP core site. Arrows represent inverted repeats. A previous version of this 
sequence had the following differences: 504 A->T and 505 G~>A. 
[0011] Figure 3 presents (A) Nucleotide sequence of the Mx9 attBl site [SEQ ID 

N0:3] and (B) Nucleotide sequence of the Mx9 attB2 site [SEQ ID N0:4]. Nucleotides 
in bold are the 42 bp [SEQ ID N0:5] identical in the Mx9 attP site. Underlined 
nucleotides encode tRNA^^^ Arrows; inverted repeat wathin attB2. (C) Nucleotide 
sequence of the native Mx9 attBl [SEQ ID N0:6]. Nucleotides in bold indicate the 
partial core sequence. (D) Nucleotide sequence of the attP site [SEQ ID NO: 7]. Arrows; 
inverted repeat. 

[0012] Figure 4 presents the predicted cloverleaf secondary structure for tRNAgly 

fromM xaiithus [bases 1397 to 1428 of SEQ ID NO:l]. The bases that are contained 
within the core attB sequence are outlined. 

[0013] Figure 5 shows an agarose gel of PGR amplified DNA fragments. Lanes 

1. 100 bp ladder from New England Biolabs. Lane 2. PGR amplification reactions for 
detection of attB2 in the wild type strain DZl. Lanes 3 and 4. PGR amplification 
reactions for detection of attB2 in two independent isolates that contain a plasmid 
integrated at attBL Lanes 5 and 6. PGR amplification reactions for detection of attB2 in 
two independent isolates that contain a plasmid integrated at attB2. 
[0014] Figure 6A shows the lacZ gene transcribed firom the pilA promoter 

integrated at the either the pilA chromosomal location, Mx9 attBl or attB2, or the Mx8 
attB sites. Figure 6B and Figure 6G show the lacZ gene transcribed from the mgl 
promoter integrated at the either the mgl chromosomal location, Mx9 attBl or attB2, or 
the Mx9 attB sites. 

[0015] Figure 7 shows the consensus sequence of a Chrysoperla carnea 

transposase gene [SEQ ID NO: 19]. 

DETAILED DESGRIPTION OF THE INVENTION 
[001 6] The present invention provides methods and materials for transforming 

bacterial cells using a bacteriophage Mx9 transformation system (also called an Mx9 
enzyme system). In one aspect, the invention provides an Mx9 transformation system 
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that may be used to introduce DNA into a bacterial cell comprising an attB site. The 
Mx9 transfonnation system comprises (1) a gene encoding a protein with Mx9 integrase 
activity and (2) a DNA vector comprising an attachment site {attP) recognized by the 
attachment site. The int gene product catalyses recombination between the attP and attB 
sites, resulting in integration of DNA sequences from the DNA vector. Proteins with 
Mx9 integrase activity, the attP site, and attB site are described in detail below. 
[0017] In one embodiment of the invention, the attB site comprises the 42-b core 

sequence [SEQ ID NO:5]. The attB site may further include at least a portion of the 
sequences flanking the attB I and/or attB2 site core sites (e.g., attR and attL, discussed 
below, which comprise portions of SEQ ID NOS: 3, 4 and 6). In an embodiment, the 
attP site comprises the 42-b core sequence [SEQ ID N0:5]. The attP site may further 
include at least a portion of the sequences flanking the core sequence, e.g., as shown in 
Figure 3D. In an embodiment, the protein with Mx9 integrase activity (hereinafter, ''int 
protein") is the product of the ifit gene having the sequence of SEQ ID N0:2, It will be 
apparent to the reader that the attB site, attP site and int protein used in the practice of the 
invention need not be identical to those of the naturally occurring Mx9-Myxococcus 
xanthus system and that the invention can be practiced using an having sequences 
substantially identical to those of the naturally occurring sequences. For example, the int 
protein can dififer from SEQ ID N0:2 by conservative amino acid replacements or other 
substitutions, so long as it has Mx9 integrase activity, i.e. catalyses recombination 
between attP and attB sites having the sequences of SEQ ID N0:7 and 4, respectively 
(see Figure 3). Conversely, the attP and attB sites can differ from naturally occurring 
sites (and may comprise only a fraction of SEQ ID N0:7, 3, 4, or 6), so long as they are 
recognized by the int protein having a sequence of SEQ ID N0:2. 
[0018] In one embodiment, the protein with Mx9 integrase activity has the 

sequence shown in Figure 2 [SEQ ID N0:2], or has a substantially identical sequence. In 
this context, substantial sequence identity means at least about 70%, more often at least 
about 80%, most often at least about 90% identity. Sequence identity can be calculated 
according to the method of Pearson and Lipman, 1988, Proc. NatL Acad, Sci. U.S.A. 
85:2444 using default parameters. In an aspect the invention provides an integrase 
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having the sequence shown in Figure 2 [SEQ ID N0:2] or having a substantially identical 
sequence and having integrase activity (e.g., when substrates are the sequence of attP and 
attB2 sites shown in Figure 3). In an aspect, the integrase is encoded by a DNA having 
the sequence of SEQ ID N0:1 or a substantially identical sequence, e.g., at least about 
70%, at least about 80%, at least about 90%, or at least about 95% identical (which can 
be calculated fornucleic acids using the method of Altschul, 1990, J. Mol BioL 215:403- 
10 using default parameters). In another aspect, the invention provides an isolated or 
recombinant DNA molecule comprising the sequence of SEQ ID N0:1 or a substantially 
identical sequence (e.g., at least about 70%, more often at least about 80%, most often at 
least about 90% identity). In a related aspect, the invention provides an isolated or 
recombinant DNA molecule comprising a sequence encoding SEQ ED N0:2 or a 
substantially identical sequence (e.g., at least about 70%, at least about 80%, or at least 
about 90% identity). In some embodiments the isolated or recombinant DNA is less than 
5000, less than 1000, less than 5000 or less than 2000 bases in length. In one aspect, the 
invention provides a recombinant vector comprising an integrase encoding gene. In an 
embodiment, the gene is operably linked to a promoter that functions in a host cell, so 
that upon introduction into a cell the integrase is expressed in a host cell. 
[0019] In an aspect, tlie attP and attB sites comprise the 42-base core sequence, 

and may also comprise at least about 10, at least about 20, at least about 30, at least about 
40, at least about 50, at least about 100, or all, of one or more of the flanking sequences 
shown for attP, attBl or attB2 in Figure 3 [e.g., SEQ ID N0S:7, 3, and 4 respectively], or 
a substantially identical sequence. The attB and attP core sequences may be sufficient 
for recombination. Alternatively, at least a portion of the flanking sequence(s) may be 
necessary for recombination or improve recombination frequency. The precise extent of 
sequence required for efficient recombination can easily be determined using routine 
assays for recombination using a series of constructs comprising different amounts of 
sequence, 

[0020] In an aspect, the invention provides an isolated or recombinant DNA 

molecule comprising a sequence selected from a sequence comprising the Mx9 attB J site 
[SEQ ID N0:3]; the Mx9 attB2 site [SEQ ID N0:4]; the Mx9 native attBl site [SEQ ID 
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N0:6], the attR site ofattBI [nucleotides 205-360 of SEQ ID N0:3], the attR site of 
anB2 [nucleotides 207-360 of SEQ ID NO:4], the attL site ofattBI [nucleotides 1-162 of 
SEQ ID N0:3] or the attL site of attB2 [nucleotides 1-164 of SEQ ID N0:4], or, 
alternatively, at least about 10, at least about 20, at least about 30, at least about 40, at 
least about 50, at least about 100, from, or all of, an aforementioned sequence. In some 
embodiments the isolated or recombinant DNA is less than 5000, less than 1000, less 
than 500 or less than 200 bases in length. In an aspect, the invention provides an isolated 
or recombinant DNA molecule comprising a 42 base sequence corresponding to 
nucleotides 165-206 of SEQ ID N0:4, i.e., SEQ ID N0:5. In an aspect, the invention 
provides an isolated or recombinant DNA molecule comprising an attP sequence. In one 
embodiment the attP sequence consists of or comprises SEQ ED NO:5, or alternatively, 
SEQ ID N0:7, or at least 50, at least 100, or at least 150 bases of SEQ ID N0:7 
(generally including the core sequence). The invention provides recombinant vectors 
comprising any of the aforementioned DNA molecules. 

[0021] In one aspect the attB and attP sites comprise identical sequences, e.g., 42 

base pair core sequences. In an embodiment, the attB site is located within the 5 'region 
of the tRNA®'^ gene of the host cell. In another aspect, the one or more attB sites are 
comprised ofattBI and/or attB2. In an embodiment, the present invention provides 
methods wherein the target DNA for the Mx9 transformation system comprises flanking 
sites attR and attL, and the integrase protein, when expressed, is an enzyme that 
facilitates site-specific recombination through binding to the attP and attB sites. 
[0022J The int gene and attP site may be situated on the same vector. However, 

the integrase can function in trafis and, accordingly, the sites can be introduced on 
different vectors. In another embodiment of the invention, the vector comprising an attP 
site is introduced into a recombinant cell expressing the int gene (e.g., a cell stably 
transformed with int protein encoding gene). As used herein, *Vector" has its usual 
meaning in the art, and refers to polynucleotide elements that are used to introduce 
recombinant nucleic acid into cells for either expression or replication. Exemplary vector 
classes include recombinant DNA or RNA constructs, such as a plasmid, a phage, 
recombinant virus or other vectors. An "expression vector" is a vector capable of 
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expressing DNAs that are operatively linked with regulatory sequences, such as promoter 
regions. It will be appreciated by those of skill that the vectors may contain additional 
elements for selection (e.g., antibiotic resistance markers), cloning (e.g., polylinkers), 
replication, and the like. Appropriate expression vectors are well known to those of skill 
in the art and include those that are replicable in prokaryotic cells, and those that remain 
episomal or those which integrate into the host cell genome (the term "host" cell refers to 
the cell into which the attP containing vector is introduced). It will be appreciated that a 
naturally occurring (non-recombinant) Mx9 phage is not itself a vector, although a 
recombinant Mx9 phage modified to carry a heterologous DNA would be considered a 
vector. 

[0023] The integrase gene of the Mx9 transformation system is operably linked to 

a promoter that fimctions in the intended host. Numerous prokaryotic, viral and synthetic 
promoters are known in the art and include, for example act promoters, tcm promoters, 
promoters derived from sugar metabolizing enzymes, such as galactose, lactose (Jac) and 
maltose, promoters derived from biosynthetic enzymes such as for tryptophan {trp\ the 
(J-lactamase (6/a), bacteriophage lambda PL and T5, synthetic promoters, such as the tac 
promoter (U.S. Patent No. 4,551,433), and mariner-type promoters may be used 
Exemplary promoters for Myxococcus cells include the native int gene promoter, the pilA 
promoter and the mgl promoter (see Wu and Kaiser, 1997, "Regulation of expression of 
the pilA gene in Myxococcus xanthus'' J. BacterioL 179:7748-7758 and GenBank 
accession number AF377950). 

[0024] The methods of the present invention may be used to transform any of a 

variety of host cells tliat comprise an attB attachment site recognized by the int gene 
product. Importantly, cells that lack a required integration or attachment site can be 
genetically engineered to contain one or more such sites, and the integrase gene can be 
placed under the control of a desired promoter. Thus, the invention can be applied to 
virtually any host cell. The invention is particularly suited for Myxobacteria, such as 
Sorangium or Myxococcus. In certain embodiments, the host cells of the present 
invention may be Sorangium cells (e.g., Sorangium cellulosum), Myxococcus cells (e.g.. 
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Myxococcus xanthus), Cystobactera, bacteria of order Stigmatella (e.g., S. erecta and S, 
aurantiaca), Pseudomonas cells, or Streptomyces cells. 

I0025J Methods for introducing the recombinant vectors and exogenous DNA 

molecules of the present invention into suitable hosts are known to those of skill in the art 
and typically include the use of CaCb or other agents, such as divalent cations, 
lipofection, DMSO, protoplast transformation, conjugation, or electroporation. 
References herein to "transformation" and its grammatical equivalents is intended to 
encompass any method of introducing an exogenous DNA into a cell. 
[0026] In one aspect, the present invention is directed to methods of transforming 

deoxyribonucleic acid (DNA) into a bacterial host cell to effectuate or improve 
polyketide expression. In one embodiment, the method comprises a) introducing a gene 
to the DNA of a bacteriophage Mx9 transformation system, said system comprising a 
gene encoding an integrase protein (int) and an attachment site (attP); b) introducing said 
bacteriophage Mx9 transformation system to a host cell that contains a nucleotide 
sequence encoding a polyketide and one or more integration sites (attB) located in the 
DNA of said host cell; and c) transforming said host cell with said gene by site-specific 
recombination at the one or more attB sites. 

[0027] As noted, the invention provides materials and methods useful for 

insertion of a gene or genes into a host cell, even if that host cell lacks an Mx9 
attachment site. Thus, in accordance with the methods of the invention, such host cells 
can be modified to include the required attachment site. One usefiil method for modifying 
host cells to include an Mx9 attachment site is transposon-based transformation (see 
provisional patent appHcation no. 60/403,290 (filed August 13, 2002) and U.S. patent 

application no. 10/ , filed August 13, 2003, entitled "Transposon-Based 

Transformation System," having attorney docket number 30062-2009800). In one 
embodiment, a transposon vector comprising (1) inverted terminal repeat sequences 
(ITRs) comprising the sequence ACAGGTTGGCTGATAAGTCCCCGGTCT [SEQ ID 
N0:17] GGATCCAGACCGGGGACTTATCAGCCAACCTGT [SEQ ID N0:18] and 
(2) a gene encoding a transposase having a sequence shown in Fig. 7, optionally 
comprising an E137K mutation, operably linked to a T7A1 promoter (Lanzer et al., 1988, 
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Proc, Nat'lAcadSci 85:8973-77) is used. In one embodiment, an attB site is introduced 
into a bacterial cell genome by a) transforming the cell with a transposon vector 
comprising inverted repeat sequences and a nucleotide sequence comprising a 
bacteriophage Mx9 integration site (attB), whereby the transposon vector transposes into 
the DNA of said cell; b) introducing a gene to the bacteriophage Mx9 transformation 
system, said system comprising a gene encoding an integrase protein (int) and an 
attachment site (attP); c) introducing said bacteriophage Mx9 transformation system to a 
host ceil; and d) transforming said host cell with said gene by site-specific recombination 
at said attB site. Li one aspect, the invention provides a method for a) transforming a cell 
that contains a nucleotide sequence encoding a polyketide synthase with a transposon 
vector comprising inverted repeat sequences and a nucleotide sequence comprising a 
bacteriophage Mx9 integration site (attB), whereby the transposon vector transposes into 
the DNA of said cell; b) introduciiig a gene into a bacteriophage Mx9 transformation 
system, said system comprising a gene encoding an integrase protein (int) and an 
attachment site (attP); c) introducing said bacteriophage Mx9 transformation system to a 
host cell; and d) transforming said host cell with said gene by site-specific recombination 
at said attB site. 

[00281 ^ another aspect, vectors useful for introducing genes into host cells 

containing an Mx9 integration site are provided. In a particular aspect, vectors of the 
present disclosure include (1) vectors (including bacteriophage and plasmid vectors) 
comprising DNA encoding an Mx9 phage attachment site (attP), and another gene, and 
(2) vectors comprising DNA encoding an integrase protein, an Mx9 phage attachment 
site (attP), and another gene. The other gene can be any DNA sequence that is desired to 
be introduced into the target cell, whether encoding a protein or not. As described 
below, in some embodiments, the gene changes or improves polyketide production in a 
polyketide producing cell. 

[0029] In another aspect, the present invention provides host cells, including e.g, , 

M. xanthtis host cells, comprising genes introduced by the described methods. In one 
embodiment, the present methods, materials, host cells and vectors are directed to 
enhancing the production of a useful compound, including but not limited to a polyketide. 
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through the introduction of one or more genes into the DNA of a variety of bacterial host 
cells. Thus, in one aspect, transformed host cells are provided that are produced by the 
claimed methods, which host cells comprise one or more genes integrated to effectuate or 
improve polyketide expression by the cell. For example, M. xanthus may be used, for 
example, for the production of epothilone (4; US Pat. No. 6,410,301 "Myxococcus host 
ceils for the production of epothilones") and genes may be introduced into such 
epothilone-producing cells to affect the amount, structure or other characteristics of the 
polyketide produced. In one embodiment, host cells of the present invention are 
epothilone-producing cells, wherein the epothilone produced is generally selected from 
epothilone A, B, C, and D. 

[0030] In one aspect, a gene that improves polyketide production upon functional 

integration into the DNA of a host cell is introduced into a cell that expresses, or can be 
engineered to express, a polyketide synthase. In one aspect, the genes introduced into a 
host cell by the methods of the invention comprise an operon of a prpE gene, accA^ and 
pccB genes to produce increased quantities of malonyl-CoA and/or methylmalonyl-CoA. 
The genes can be under the control of a suitable promoter, such as a PKS promoter, z.e., 
from epothilone (U.S. Pat. No. 6,303,342; U.S. Patent Application Serial No. 09/957,483, 
filed September 19, 2001), soraphen (U.S. Pat. No. 5,716,849, incorporated herein by 
reference), or tombamycin (U.S. Patent Application Serial No. 09/942,025, filed August 
28^ 2001, and U.S. Pat Nos. 6,280,999, and 6,090,601, each of which is incoiporated 
herein by reference) gene clusters. The gene or genes are inserted in a recombinant 
bacteriophage Mx9 of the invention and then integrated into the DNA of the host cell. In 
one aspect the prpE gene, accA, and pccB genes are inserted into a Myxococcus xanthus 
cell. 

[00311 In ianother aspect, the genes inserted into the host cell may comprise a 

matB gene or an operon comprising matB and matC genes, such as those from Rhizobium 
leguminosarum bv. trifolii, which respectively encode a ligase that can attach a CoA 
group to malonic or methylmalonic acid and a transporter molecule to transport malonic 
or methylmalonic acid into the host cell respectively, to produce increased quantities of 
malonyl-CoA and methylmalonyl-CoA (U.S. patent application Serial Nos. 09/687,555, 



-10- 



wo 2004/018635 



PCT/US2003/026413 



filed October 13, 2000; 09/798,033, filed February 28, 2001; and 10/087,451, filed 
February 28, 2002; each of which is incorporated herein by reference). 
[0032] In another aspect, vectors useful for introducing genes into host cells 

containing an Mx9 integration site are provided. In a particular aspect, vectors of the 
present disclosure include bacteriophage vectors comprising DNA encoding an integrase 
protein, an Mx9 phage attachment site {attP\ and another gene. In an embodiment, the 
vector is aplasmid vector. In a related aspect, the invention provides a vector selected 
fi-om the group consisting of pKOS35-93, pKOS35-l 17.9.7, pKOS249-12, pKOS249-23, 
and pKOS249-3 1 . In one aspect of the invention, an Mx9 transformation system is used 
to introduce DNA into a host chromosome. 

[0033] In related aspects, the invention provides a method of transforming a 

bacterial host cell, said method comprising the steps of a) introducing a first gene into a 
bacteriophage Mx9 transformation system, said system comprising a second gene 
encoding an integrase protein {int) and an attachment site {attP)\ b) introducing said 
bacteriophage Mx9 transformation system to a host cell that contains one or more 
integration sites {attB) located in the DNA of said host cell; and c) trknsforming said host 
cell with said first gene by site-specific recombination at the one or more attB sites. In an 
embodiment, the one or more attB sites are comprised of attBl (SEQ ID N0:3), attB2 
(SEQ ID N0:4), or a combination thereof. In an embodiment, the cells are Myxococcus 
cells, for example epothilone-producing cells. In an embodiment, the epothilone is 
selected from the group consisting of epothilone C and D. In some embodiments, the 
first gene is selected fi-om the group consisting of prpE^ accA^pccB, matB and matC 
genes. In an embodiment of the invention, the attB and attP sites are comprised of 
identical sequences, which may be identical 42 base pair sequences corresponding to 
nucleotides 1394-1435 of SEQ ID N0:1. In an embodiment, the attB site is located 
within the 5 ' region of the tRNA®'^ gene. In an embodiment of the method, DNA firom 
said attR site is deleted upon transformation of said host cell. In an embodiment, the 
gene encoding an integrase protein is altered upon transformation of said host cell 
[0034] The invention also provides a transformed bacterial host cell produced by 

an aforementioned method. In an embodiment, the host cell produces an epothilone 
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selected from epothilone A, B, C, and D, Optionally, the first gene is selected from the 
group consisting of prpE, accA.pccB, matB and matC genes. 

[0035] In an aspect, the invention provides a method of transforming a bacterial 

host cell that lacks a bacteriophage Mx9 integration site {attE) to improve polyketide 
expression, said method by a) transforming a host cell with a transposon vector 
comprising inverted repeat sequences and a nucleotide sequence comprising a 
bacteriophage Mx9 integration site {attB), whereby the transposon vector transposes into 
the DNA of said cell; b) introducing a first gene to a bacteriophage Mx9 transformation 
system, said system comprising a second gene encoding an integrase protein {int) and an 
attachment site {attP)\ c) introducing said bacteriophage Mx9 transformation system to 
the host cell; and d) transforming said host cell with said first gene by site-specific 
recombination at said attB site. According to this method, the host cells may be 
Sorangium cells, Myxococods cells, Pseudomonas cells, or Streptomyces cells as well as 
others. In embodiments, the host cells produce epothilone selected firom epothilone A, B, 
C, and D and/or the first gene is selected firom the group consisting of prpE, accA,pccB, 
matB and matC genes and/or the attB site comprises flanking sites attR and attL, and said 
integrase protein, when expressed, is an enzyme that facilitates said site-specific 
recombination through binding^o attB and attP sites. The invention fiirther provides a 
transformed bacterial host cell produced by this method, which optionally may produce 
an epothilone selected firom epothilone A, B, C, and D. 

[0036] The invention also provides a bacteriophage Mx9 vector comprising DNA 

encoding an integrase protein, an Mx9 phage attachment site (attP), and another gene. 

Experimental Results and Discussion 
Materials and Methods 

[0037] Bacteria, Phage, and plasmids. DZl is a nonmotile strain of M. xanthus 

and was used for plating Mx9 and for characterization of the Mx9 attachment sites (12). 
DK816 is the natural M xanthus isolate lysogenic for Mx9 (9). M xanthus strains were 
grown in CYE medium (1) or 1% CTS (1% casitone, 0.2% MgS04-7H20, 50 mM 
HEPES pH 7.6). Phleomycin (Cayla) was used at a concentration of 30 ^ig/ml. The Mx9 
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phage was reisolated from DK816 by growing a culture to stationary phase, pelleting the 
cells, and plating dilutions of the supernatant onto DZl, High titer stocks of Mx9 were 
made by coring a plaque and placing it in phage buffer (10 mM MOPS [pH7.6], 4 mM 
MgCl2, 2 mM CaCh). The eluted phage were diluted and mixed with 0.5 ml of DZl in 
early stationary phase. After incubating the cells and phage at room temperature for 20 
minutes, 2.5 ml of top agar was added and the suspension was poured onto phage plates 
(1% BBL trypticase, 0.1% MgS04-7H20, 1% agar, 10 mM MOPS pH 7.6). The plates 
that gave confluent lysis after 2 days of incubation at SC'C were overlayed with 5 ml of 
phage buffer and incubated at 4°C overnight. The eluted phage were stored at 4°C. Phage 
stocks greater than 1x10^ pfu/ml were obtained with this method. Plasmids used are 
described in Table 1. 



Table 1 

Plasirdd Characteristics 

pKOS35-117.9,9 amp^ kan^ colEI, 4.6 kb fragment from Mx9 

pKOS139-29 amp^ colEI, Pt7Ai Mx8 int attP- 

pKOS139-47 tc^ pl5A, Tmgi lacZ, Mx8 attF 

pKOS178-86 tc^, pl5 A, P^/m lacZ, Mx8 attP 

pKOS178-177 tc^, pl5A, PpiiA lacZ, Mx9 int attP 

pKOS178-188 tc^ pl5A, Pmgi lacZ, Mx9 int attP 

pKOS249-31 amp"^ bleo"^ colEI, Pt7Ai Mx9 int attP 

[0038] Isolation of phage DNA. The phage from a high titer stock were pelleted 

by centrifuging in an SS-34 rotor at 28,000 rpm for 3 hours and then resuspended in TE 
(10 mM Tris [pH7-6] 1 mM EDTA). The phage proteins were removed by extracting 
twice with phenol and twice with phenol/chloroform/isoamylalcohol. The DNA was 
precipitated and resuspended in TE. 

[0039] Isolation and sequence of the phage attachment site. To isolate the phage 

attachment site, phage DNA was partially cleaved with HinPl and the fragments were 
ligated into pKOS35-93 cleaved with Accl. The plasmid pKOS35-93 is pBluescriptH 
SK+ with the kanamycin resistance from Tn5 ligated into the Smal and EcoKL sites. One 
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plasmid, pKOS35-l 17.9.7, integrated efficiently into the chromosome. The insert from 
this plasmid was sequenced 

[0040] Isolation of the bacterial attachment site. The bacterial attachment site 

(attB) was isolated by electroporating pKOS35-l 17.9.7 into DZl, making cliromosomal 
DNA, and then recovering the plasmid with flanking chromosomal DNA. Six kanamycin 
resistant colonies were picked and chromosomal DNA was prepared from each. The 
DNA was cleaved with either Pstl orMoI, ligated, and then transformed into E. coll 
Three colonies from each of the electroporations were picked and the recovered plasmids 
were cleaved with Pstl or Xhol. One plasmid from each was sequenced using either 
primer 183-66.3 (GAAGGAGGCACCATGCACGG [SEQ ID N0:8] or 183-66.4 
(CTCACTGAGAGTGAAGCCGC [SEQ ID N0:9]). 

[0041] PGR amplification of the Mx9 attB. Primers were designed to PGR 

amplify attSl and attB2. Primers 183-99.4 (GGAGGTGCGGGAGGGGCGGA [SEQ ID 
NO:10]) and 183-99.6 (TGCCAGGGGTTAGGGCTTG [SEQ ID N0:11]) were used to 
amplify a 285 bp attBl fragment and 183-99.5 (TATCCCAGCAACCGCCGGAG [SEQ 
ID N0:13]) with primer 183-99.4 was used to amplify a 373 bp attB2 fragment. To 
amplify the native attB J site primers 183-99.6 and 249-179.7 
(CAGCACGGGTGCAGCAAC [SEQ ID NO: 14]) were used to amplify a 250 bp 
fragment. PGR reactions were done using chromosomal DNA from DZl and the 
FailSafe™ PGR system from Epicentre. Amplification conditions were 96°C for two 
minutes and then 30 cycles of 94°C 30 seconds, 55°C for 1 minute, 72'>C for 2 minutes. 
[0042] Construction of a minimal integration plasmid. The int gene was PGR 

amplified from pKOS35-l 17.9.7 using the primers 1 1 1-74.4 
(GCGAATTGGCTGAGGGCAGCGGGTGATT [SEQ ID N0:15]) and 11 1-82.5 
(GCGGATGGGGGTCAGGGGTGCGTGGGACGGG [SEQ ID N0:16]). PGR 
amplification conditions were those previously described. The amplified fragment was 
ligated into the EcoRV site of pLitmus 28 (New England Biolabs) to create pKOS249-12. 
The int gene was removed from this plasmid by cleaving with EcoRI, the DNA ends were 
made blunt with the Klenow fragment of DNA polymerase followed by cleaving with 
Ncol. The fragment was ligated with pUHE24-2B (3) that was cleaved with Pstl, the 
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DNA ends were made blunt with the Klenow fragment of DNA polymerase I and cleaved 
with Afcol. The resulting plasmid, pKOS249-23, contains the Uu gene under the control of 
the £. coli phage T7 Al promoter that has been engineered to contain 2 Lad binding sites 
to repress transcription. The bleomycin resistance gene was added to this plasmid by 
isolating the bleomycin resistance gene jBrom pKOS 1 83-1 1 2 as a BarnRl to HinAlll 
fragment, the DNA ends were made blunt with the Klenow fragment of DNA polymerase 
I and ligating it with pKOS249-23, which was cleaved with Xlwl and the DNA ends were 
made blunt with the Klenow fragment of DNA polymerase I. This plasmid is designated 
pKOS249-31. 

[0043] p-gaiactosidase assays. Seed cultures of two isolates for each integration 

site were grown in 1% CTS (5 ml) to mid to late log phase. To start the assay cultures, 35 
ml of CTS was inoculated with 1 ml of seed culture at an ODeoo of 0.073. p-galactosidase 
assays were performed by removing an aliquot of cells and adding them to Z buffer for a 
combined volume of 1 ml. The cells were lysed by adding one drop of 0.1% SDS, two 
drops of chloroform, and vortexing the sample for 5 seconds. The assay was initiated by 
the addition of 0.1 ml of O-nitrophenyl p-D-galactopyranoside (8 mg/ml) and mixing. 
The reactions were stopped by the addition of 0.5 ml of 1 M Na2C03. The ODeoo of the 
cell culture and the OD420 of the enzyme reactions were determined using a SpetraMax 
250 plate reader. Miller units were determined as previously described (1 0). 
[0044] Accession numbers. The Mx9 sequence has been assigned the accession 

number AY247757. The accession numbers for attBl and attB2 are AY297770 and 
AY297771, respectively. 

[0045] Identification of the Mx9 int and attachment site. To identify the int gene 

and attachment site, a library of 5-8 kb fragments of Mx9 was made, and a clone that was 
able to integrate into the M xanthus chromosome was identified. The insert in this 
plasmid, pK0S3 5-1 17.9.7, was sequenced. Five complete and one partial open reading 
frames (orf) were identified in the 4.6 kb fragment (Fig. 1). Orf 1 was the only reading 
frame that showed amino acid similarity with other known integrase genes, and therefore 
was given the gene designation int. The other orfs resembled orfs from Mx8; orf 2, or£3, 
orf4, orfS, and orf6 showed similarity to P15, P14, P16, P17, and P18, respectively from 
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Mx8. From the degree of similarity of these orfs between, it appears that Mx8 and Mx9 
are very similar phages. 

[0046] The Mx9 //// gene was examined for sequences that would indicate an 

attachment site. Analysis revealed a DNA segment within the ini gene (nt 1397-1428 
(Figure 2)) that had sequence similarity to tRNA^^ from various organisms. Since Mx8 
integrates into the tRNA^** gene of M xanthus, the sequence that showed similarity with 

was predicted to serve as the site of integration for Mx9. 
[0047] To test this prediction, chromosomal DNA from six integrants containing 

pKOS35-l 17.9.7 were cleaved with restriction enzymes, ligated, and transformed into E, 
coli to recover the plasmid along with flanking chromosomal DNA, Sequencing, using 
primers adjacent to the proposed attachment site, revealed that the point of recombination 
was indeed that of the putative tRNA^'^ . Furthermore, the sequence of flanking 
chromosomal DNA showed that there were two attB sites. It appeared from the nimiber 
of integrants at each site, 3 for attBl and 3 for attB2, that both served equally well as the 
insertion site (Figure 3). 

[0048] Structure of the two attB sites. Figure 3 shows 360 bp from each of the 

attB sites. Both have a common 42 bp core sequence that is also found within the Mx9 int 
gene. In addition, there are 22 bp 5' to both attB sites that are identical in 21 positions. 
There is a putative inverted repeat that may play a role in Megrase protein binding at the 
attB and attP (Fig. 3b). The site of integration within attB2 lies in the 5* end of tRNA^'^ 
gene, which is underlmed in Figure 3b. However, the sequence of attBl does not contain 
a complete tRNA^*^ gene. Figure 4 shows the predicted foldmg of this segment of attB2 
into a corresponding tRNA. 

[0049] Analysis of the attR and attL half-sequences for both attB sites reveals the 

two attR are identical whereas the attL differ. This is also the case with the two Mx8 attB 
sites (7). Plasmids containing the Mx8 int gene preferentially integrate at attBl, and this 
integration often is accompanied by a deletion between attBl and attB2 (8). 
[0050] To determine if the identical attR sites are due to the presence of two attB 

sites containing with identical attR sites or due to the deletion of the DNA between the 
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two attB sites after integration into one of them, PGR analysis was performed using either 
primer pair 183-99.4 and 183-99.6 for attB I or 183-99.4 and 183-99.5 for atiB2, 
10051] A PGR fragment was detected using primers specific for attB2 but none 

was detected using primers specific for attBl (data not shown). This suggests that a 
deletion may occur upon mtegration of attBl but to be certain that the lack of a PGR 
product was not due to the failure to PGR amplify the DNA fragment, further 
experiments were performed. 

[0052] Next, the genomic sequence ofM xanthus strain DK1622, generated by 

Monsanto and available at the TIGR web site, was examined for the two attB sites 
(www.TIGR.org). The attB2 sequence was almost identical to that previously identified 
(Fig. 3B) but only the first 178 bp of the attBl site from Figure 3 A was present before the 
sequence diverged. Using this sequence information for attBl, a primer was designed that 
was approximately 100 bp downstream from the point at which the sequence diverged 
(249-179.7). Using this primer along with 183-99.6, the one 5' to the attBl site, and DZl 
genomic DNA, a PGR product of approximately 250 bp was isolated and sequenced. The 
PGR product was identical to that obtained from the DK1622 genomic sequence (Fig 
3C). Analysis of this sequence reveals that only 16 bp of the 42 bp core att site are 
present in the native attBl site. 

100531 Final proof that a deletion does occur between attBl and attB2 is shown in 

Figure 5. Using the primer pair 183-99.4 and 183-99.5, the ones that amplify the attB2 
site, PGR amplification was performed using genomic DNA from the wild type strain or 
strains harboring a plasmid integrated at either attBl or attB2. Using chromosomal DNA 
from DZl, a strain with no plasmids integrated at either attB site, a 372 bp PGR product 
containing the attB2 site was detected in lane 2 figure 5. Two strains that contain 
insertions at attB2, lanes 5 and 6 (Fig. 5) do not give the 372 bp band and should not 
amplify the attB2 due to the presence of a plasmid integrated at that site. If a deletion 
does occur between atfBl and attB2y then there should be no detectable amplification of 
attB2 when a plasmid integrates at attBl. Lanes 3 and 4 (Fig. 5) shows that no attB2 PGR 
product is detected, indicating a deletion of DNA between attBl and attB2 when an 
integration occurs at attBl. 
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(0054] Integration results in the alteration of the carboxy temiinus of the Mx9 Int 

protein. Because attP lies within the int gene, integration into the chromosome should 
alter the 3' end of int gene is altered. From the 1160 bp of attR that has been sequenced, 
no stop codon has been identified (data not shown). Thus 70 amino acids from Int should 
be removed and more than 389 amino acids should be added to the Int protein that is 
synthesized after integration into the chromosome. These additional amino acids 
presumably will reduce the enzymatic activity of Int because the IntX protein of Mx8 has 
lost 1 12 residues and added 13 amino acids, and is a less active at site specific 
recombination (8). 

[0055] Mx9 Int is the only phage protein required for integration. To determine 

whether int is necessary and sufficient for integration, the int gene was PGR amplified 
and ligated into an E. coli expression vector that uses an engineered phage T7 Al 
promoter. The plasmid pKOS249-31, when electroporated into DZl, integrated 
efficiently into the chromosome; approximately IxlO"^ colonies were obtained per 
microgram of DNA. Thus, the Mx9 int gene is the only phage encoded protein required 
for integrative recombination into the bacterial chromosome. 

[0056] Transcription firom the pilA and the mgl promoters integrated at the two 

Mx9 attB sites. To find a phage attachment site on the M xanthus chromosome that 
supported efficient expression of genes firom a variety of promoters, fiisions of lacZ to the 
mgl or pilA promoters were constructed and transcription fi-om these promoters at the two 
Mx9 attB, the Mx8 attB, and the native chromosomal location was analyzed. Figure 6A 
shows the expression level of the pilA promoter (Pp//^) at the four different locations. 
Surprisingly, there was little transcription when the P^a^ plasmid was integrated by 
homologous recombination at the pilA location (pKOSl 78-86). This suggests that there 
may be a deletion in the pilA promoter region that abolishes activation of the pilA 
promoter in DZl since there was no expression in several isolates that were examined. As 
we have observed previously, little transcription from Pp/w is seen when integrated at 
Mx8 attB site (pKOS178-86 + pKOS139-29). However, the Mx9 sites show high levels 
of transcription from P^/m (pKOS178-177) and they are fairiy similar at both sites, 
although attB2 had high variability of expression from the two isolates examined. In 
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addition, the regulation at both sites was similar; transcription from Ppm increased during 
late log and stationary phases. 

[00S7] The results of transcription from the mgl promoter (Pw,^/) are shown in 

Figure 6B. Transcription from P^^^/at the two Mx9 attB (pKOS 178-188) sites was better 
than at the Mx8 site (pKOS139-47 + pKOS139-29) but not as high when integrated by 
homologous recombination at the chromosomal mgl location (pKOS 139-47). However, 
this lower expression at the two Mx9 sites may be vector dependent. Using a plasmid 
that contained only the atiP site and integrating it by supplying the int gene in trans, P^^/ 
functions just as well at both Mx9 sites as it does at the chromosomal mgl location (see 
Fig. 6C). In this experiment, a plasmid was constructed that contained the mgl promoter 
fused to lacZ and harbored only the Mx9 attP site. This plasmid was integrated into the 
Mx9 attBl or attB2 by co-electroportating it with a second plasmid that expressed the int 
gene. P-galactosidase assays with cells containing this plasmid reveals that the levels of 
expression from the mgl promoter is as good, if not better, than the native mgl 
chromosomal location. Thus expression from the mgl promoter at the Mx9 attB locations 
may be vector dependent. The conclusion from these studies indicates that the Mx9 attB 
sites are good for expression of foreign or native genes. 

[0058] The Mx9 int gene and attachment site have been identified, along with the 

site of integration into the M xanthus chromosome. The analysis reveals remarkable 
similarity to the int gene and attachment site from the myxophage Mx8 (7, 8, 1 1). Both 
contain the attP within the int gene and integrate within a tRNA gene. They have two 
attB sites and it appears that adjacent chromosomal DNA is deleted when integration 
occurs at one of the sites. For both, Int is the only phage-encoded protein needed for 
integration. 

[0059] A difference between the Mx8 and Mx9 phage integration systems is the 

length of their respective core sequences. The core sequence for Mx8 integration is 
smaller, composed of 29 bp. The attB 2 site has two nucleotides that differ at one end, 
which may account for the preference of Mx8 for inserting at attBL The att core region 
for Mx9 is 42 bp, but of the two integration sites only attB2 contains all 42 bases. The 
attB J site contains only 16 bases of the core sequence. The lack of a complete core 
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sequence in attBl may explain why there is always a deletion between attBl and attB2 
when integration occurs at attBL The Int protein may bind to the inverted repeat within 
the 42 bp core. Binding of the X Int protein to its att sites has been shown (5). Since the 
attBJ contains half of the inverted repeat, only half of the necessary protein complex can 
form, but once it has assembled, it may interact with the complementary half of proteins 
from attB2 to allow for integration. This would result in a looping out of the DNA 
between attBl and attB2, and its subsequent loss upon integration of DNA. 
[0060] In our PGR reactions to detect attBl with primers 1 83-99.4 & 1 83-99.6, 

the conditions were such that if the distance between attBl and attB2 was less than 2 kb, 
then a PGR product should have been detected. Since no product was observed, this 
suggests that the distance between the two sites is greater than 2 kb. Analysis of the 
DKI622 sequence shows that the two attB sites are 6.7 kb apart. Partial analysis of this 
sequence shows a couple open reading frames that have sequence similarity to 
transposase genes, suggesting the presence of a transposon. The other reading frame that 
was identified reveals high sequence similarity to protems of unknown functions. Clearly, 
the open reading frames encoded in between the two attB sites are not critical for growth 
under laboratory conditions since strains with integrations at attBl have no visible growth 
defects. 
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[0062] Numerous modifications may be made to the foregoing systems without 

departing from the basic teachings thereof. Although the present invention has been 
described in substantial detail with reference to one or more specific embodiments, those 
of skill in the art will recognize that changes may be made to the embodiments 
specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the claims which follow. All 
publications and patent documents cited in this specification are incorporated herein by 
reference as if each such publication or document was specifically and individually 
indicated to be incorporated herein by reference. 

[0063] Citation of the above publications or documents is not intended as an 

admission that any of the foregoing is pertinent prior art, nor does it constitute any 
admission as to the contents or date of these publications or documents. 
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CLAIMS 



We claim: 



1 . A method for modification of a DNA of a bacterial cell comprising in its genome 
a first attachment site recognized by a protein with Mx9 integrase activity, comprising 
introducing a Mx9 transformation system into the cell, said system comprising 

a) a gene encoding a protein with Mx9 integrase activity protein operably 
linked to a promoter active in the host cell, and 

b) a DNA vector comprising a second attachment site recognized by the 
integrase protein, which may be the same as the first attachment site. 

2. The method of claim 1 wherein the cell is Myxococcus or Sorangium. 

3. The method of claim 1 wherein the protein has a sequence at least 
substantially identical to SEQ ID N0:2. 

4. The method of claim 3 wherein the protein has a sequence of SEQ ID 



N0:2. 



5. The method of claim 4 wherein the protein is encoded by a gene 

comprising the sequence of SEQ ID N0:1. 



6. 



The method of claim 1 wherein said fust attachment site comprises SEQ 



IDN0:5. 



7. 



The method of claim 6 wherein said first attachment site is attB2. 



8. 



The method of claim 1 wherein said second attachment site comprises 



SEQIDN0:5. 
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9. The method of claim 3 wherein said first attachment site has been 
recombinantly introduced into the cell genome. 

10. The method of claim 1 wherein said DNA vector further comprises an 
exogenous gene. 

1 1 . The method of claim 1 0 wherein the exogenous gene is selected from the 
group consisting of prpE^ accA,pccB, matB^ matC and beta-galactosidase genes. 

12. The method of claim 6 wherein the first and second attachment sites are 
comprised of identical sequences. 

13. The method of claim 2 wherein the cell is Myxococcus xanthus. 

14. The method of claim 13 wherein the cell produces an epothilone. 

15. The method of claim 14, wherein the epothilone is selected from the group 
consisting of epothilone C and D, 

16. A bacterial host cell produced by the method of claim 10. 

17. The cell of claim 16 wherein that produces an epothilone selected fi:om 
epothilone A, B, C, and D. 

18. The cell of claim 17, wherein said exogenous gene is selected fi'om the 
group consisting of prpE^ accA.pccBy matB and matC genes. 
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CCA GGG 


AAA 


TCA 


ACA 


CAC 


CTT 


CGT TTT TGA ACC GAA 


CTA 


TTA 


CTC 


AAA 


GGC CTG AGA 


CGG 


GGT CCC 


TTT 


AGT 


TGT 


Val 


Glu 


Ala Lys Thr Tzp Leu 


Asp 


Asn 


Glu 


Asn 


Pro Asp Ser 


Ala 


Pro Gly 


Lys 


Ser 


Thr 






130 140 






150 




160 




170 






180 


ATA 


ATT 


GAT TGG TAT GCA AAA 


TTC 


AAG 


CGT 


GGT 


GAA ATG AGC 


ACG 


GAG GAC 


GGT 


GAA 


CGC 


TAT 


TAA 


CTA ACC ATA^CGT TTT 


AAG 


TTC 


GCA 


CCA 


CTT TAC TCG 


TGC 


CTC CTG 


CCA 


CTT 


GCG 


He 


He 


Asp Trp Tyr Ala Lys 


Phe 


Lys 


Arg 


Gly 


Glu Met Ser 


Thr 


Glu Asp 


Gly 


Glu 


Arg 






190 200 






2io 




220 




230 






240 


AGT 


GGA 


CGC CCG AAA GAG GTG 


GTT 


ACC 


GAC 


GAA 


AAC ATC AAA 


AAA 


ATC CAC 


AAA 


ATG 


ATT 


TCA 


CCT 


GCG GGC TTT CTC CAC 


CAA 


TGG 


CTG 


CTT 


TTG TAG TTT 


TTT 


TAG GTG 


TTT 


TAC 


TAA 


Ser 


Gly 


Arg Pro Lys Glu Val 


Val 


Thr 


Asp 


Glu 


Asn He Lys 


Lys 


He His 


Lys 


Met 


He 






250 260 






270 




280 




290 






300 


TTG 


AAT 


GAC CGT AAA ATG AAG 


TTG 


ATC 


GAG 


ATA 


GCA GAG GCC 


TTA 


AAG ATA 


TCA 


AAG 


GAA 


AAC 


TTA 


CTG GCA TTT TAC TTC 


AAC 


TAG 


CTC 


TAT 


CGT CTC CGG 


AAT 


TTC TAT 


AGT 


TTC 


CTT 


Leu 


Asn 


Asp Arg Lys Met Lys 


Leu 


He 


Glu 


He 


Ala Glu Ala 


Leu 


Lys He 


Ser 


Lys 


Glu 






310 320 






330 




340 




350 






360 


CGT 


GTT 


GGT CAT ATC ATT CAT 


CAA 


TAT 


TTG 


GAT 


ATG CGG AAG 


CTC 


TGT GCA 


AAA 


TGG 


GTG 


GCA 


CAA 


CCA GTA TAG TAA GTA 


GTT 


ATA 


AAC 


CTA 


TAC GCC TTC 


GAG 


ACA CGT 


TTT 


ACC 


CAC 


Atq 


Val 


Gly His He He His 


Gin 


Tvr 


Leu 


Asp 


Met Arg Lys 


Leu 


Cys Ala 


Lys 


Trp 


Val 






370 380 






390 




400 




410 






420 


CCG 


CGC 


GAG CTC ACA TTT GAC 


CAA 


AAA 


CAA 


CAA 


CGT GTT GAT 


GAT 


TCT GAG 


CGG 


TGT 


TTG 


GGC 


GCG 


CTC GAG TGT AAA CTG 


GTT 


TTT 


GTT 


GTT 


GCA CAA CTA 


CTA 


AGA CTC 


GCC 


ACA" 


AAC 


Pxo 


Atq 


Glu Leu Thr Asn Asp 


Gin 


Lvs 


Gin 


Gin 


Arg Val Asp 


Asp 


Ser Glu 


Arg 


Cys 


Leu 






430 440 






450 




460 




470 






480 


CAG 


CTG 


TTA ACT CGT AAT ACA 


CCC 


GAG 


TTT 


TTC 


CGT CGA TAT 


GTG 


ACA ATG 


GAT 


GAA 


ACA 


GTC 


GAC 


AAT TGA GCA TTA TGT 


GGG 


CTC 


AAA 


AAG 


GCA GCT ATA 


CAC 


TGT TAC 


CTA 


CTT 


TGT 


Gin 


Leu 


Leu Thr Arg Asn Thr 


Pro 


Glu 


Asn 


Phe 


Arg Arg Tyr 


Val 


Thr Met 


Asp 


Glu 


Thr 






490 500 






510 




520 




530 






540 


TGG 


CTC 


CAT CAC TAC ACT CCT 


GAG 


TCC 


AAT 


CGA 


CAG TCG GCT 


GAG 


TGG ACA 


GCG 


ACC 


GGT 


ACC 


GAG 


GTA GTG ATG TGA GGA 


CTC 


AGG 


TTA 


GCT 


GTC AGC CGA 


CTC 


ACC TGT 


CGC 


TGG 


CCA 


Trp 


Leu 


His His Tyr Thr Pro 


Glu 


Ser 


Asn 


Arg 


Gin Ser Ala 


Glu 


Trp Thr 


Ala 


Thr 


Gly 






550 560 






570 




580 




590 






600 


GAA 


CCG 


TCT CCG AAG CGT GGA 


AAG 


ACT 


CAA 


AAG 


TCC GCT GGC 


AAA 


GTA ATG 


GCC 


TCT 


GTT 


CTT 


GGC 


AGA GGC TTC GCA CCT 


TTC 


TGA 


GTT 


TTC 


AGG CGA CCG 


TTT 


CAT TAC 


CGG 


AGA 


CAA 


Glu 


Pro 


Ser Pro Lys Arg Gly 


Lys 


Thr 


Gin 


Lys 


Ser Ala Gly 


Lys 


Val Met 


Ala 


Ser 


Val 
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610 




620 




630 






640 




650 






660 


TTT 


TTC 


GAT GCG 


CAT 


GGA ATA 


ATT TTT 


ATC 


GAT 


TAT 


CTT GAG 


AAG 


GGA AAA 


ACC 


ATC 


AAC 


AAA 


AAG 


CTA CGC 


GTA 


CCT TAT 


TAA AAA 


TAG 


CTA 


ATA 


GAA CTC 


TTC 


CCT TTT 


TGG 


TAG 


TTG 


Asn 


Phe 


Asp Ala 


His 


Gly lie 


He Asn 


lie 


Asp Tyr Leu Glu Lys 


Gly Lys Thr 


He 


Asn 






670 




680 




690 






700 




710 






720 


AGT 


GAC 


TAT TAT 


ATG 


GCG TTA 


TTG GAG 


CGT 


TTG 


AAG 


GTC GAA 


ATC 


GCG GCA 


AAA 


CGG 


CCC 


TCA 


CTG 


ATA ATA 


TAC 


CGC AAT 


AAC CTC 


GCA 


AAC 


TTC 


CAG CTT 


TAG 


CGC CGT 


TTT 


GCC 


GGG 


Ser 


Asp 


Tyr Tyr 


Met 


Ala Leu 


Leu Glu Arg 


Leu Lys 


Val Glu 


He 


Ala Ala Lys 


Arg 


Pro 










740 




750 






760 




770 






780 


CAT 


ATG 


AAG AAG 


AAA 


AAA GTG 


TTG TTC 


CAC 


CAA 


GAC 


AAC GCA 


CCG 


TGC CAC 


AAG 


TCA 


TTG 


GTA 


TAG 


TTC TTC 


TTT 


TTT CAC 


AAC AAG 


GTG 


GTT 


CTG 


TTG CGT 


GGC 


ACG GTG 


TTC 


AGT 


AAC 


His 


Met 


Lys Lys 


Lys Lys Val 


Leu Phe His 


Gin Asp Asn Ala Pro 


Cys His Lys 


Ser 


Leu 










800 




810 






820 




830 








AGA 


ACG 


ATG GCA 


AAA 


ATT CAT 


GAA TTG 


GGC 


TTC 


GAA 


TTG CTT 


CCC 


CAC CCA 


CCG 


TAT 


TCT 


TCT 


TGC 


TAC CGT 


TTT 


TAA GTA 


CTT AAC 


CCG 


AAG 


CTT 


AAC GAA 


GGG 


GTG GGT 


GGC 


ATA 


AGA 


Arg 


Thr 


Met Ala 


Lys 


He His 


Glu Leu Gly 


Phe 


Glu 


Leu Leu 


Pro 


His Pro 


Pro 


Tyr 


Ser 










860 




870 






880 




890 








CCA 


GAT 


CTG GCC 


CCC 


AGC GAC 


TTT. TTC 


TTG 


TTC 


TCA 


GAC CTC 


AAA 


AGG ATG 


CTC 


GCA 


GGG 


GGT 


CTA 


GAC CGG 


GGG 


TCG CTG 


AAA AAG 


AAC 


AAG 


AGT 


CTG GAG 


TTT 


TCC TAC 


GAG 


CGT 


CCC 


Pro 


Asp 


Leu Ala 


Pro Ser Asp 


Asn Phe 


Leu 


Phe 


Ser Asp Leu Lys 


Arg Met 


Leu 


Ala 


Gly 










920 




930 






940 




950 






960 


AAA 


AAA 


TTT GGC 


TGC 


AAT GAA 


GAG GTG 


ATC 


GCC 


GAA 


ACT GAG 


GCC 


TAT TTT 


GAG 


GCA 


AAA 


TTT 


TTT 


AAA CCG 


ACG 


TTA CTT 


CTC CAC 


TAG 


CGG 


CTT 


TGA CTC 


CGG 


ATA AAA 


CTC 


CGT 


TTT 


Lys 


Lys 


Asn Gly 


Cys 


Asn Glu 


Glu Val 


He 


Ala 


Glu 


Thr Glu 


Ala 


Tyr Asn Glu 


Ala 


Lys 






970 




980 




990 






1000 




1010 




1020 


CCG 


AAG 


GAG TAC 


TAC 


CAA AAT 


GGT ATC 


AAA 


AAA 


TTG 


GAA GGT 


CGT 


TAT AAT 


CGT 


TGT 


ATC 


GGC 


TTC 


CTC ATG 


ATG 


GTT TTA 


CCA TAG 


TTT 


TTT 


AAC 


CTT CCA 


GCA 


ATA TTA 


GCA 


ACA 


TAG 


Pro 


Lys 


Glu Tyr 


Tyr Gin Asn 


Gly He Lys 


Lys 


Leu Glu Gly Arg 


Tyr Asn Arg 


Cys 


He 






1030 




1040 






















OCT 


CTT 


GAA GGG 


AAC 


TAT GTT 


GAA TAA 




















CGA 


GAA 


CTT CCC 


TTG 


ATA CAA 


CTT ATT 




















Ala 


Leu 


Glu Gly 


Asn Tyr Val 


Glu *** 





















